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Abstract. Simbad is the reference database for identifi- 
cation and bibliography of astronomical objects. It con- 
tains identifications, 'basic data', bibliography, and se- 
lected observational measurements for several million as- 
tronomical objects. 

Simbad is developed and maintained by CDS, Stras- 
bourg. Building the database contents is achieved with the 
help of several contributing institutes. Scanning the bibli- 
ography is the result of the collaboration of CDS with bib- 
liographers in Observatoire de Paris (DASGAL), Institut 
d'Astrophysique de Paris, and Observatoire de Bordeaux. 

When selecting catalogues and tables for inclusion, pri- 
ority is given to optimal multi- wavelength coverage of the 
database, and to support of research developments linked 
to large projects. In parallel, the systematic scanning of 
the bibliography reflects the diversity and general trends 
of astronomical research. 

A WWW interface to Simbad is available at: 

http: / /simbad. u-strasbg.fr/Simbad. 
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1. Introduction 

1.1. The CDS 

The Centre de Donnees astronomiques de Strasbourg 
(CDS) defines, develops, and maintains services to help 
the astronomers find the information they need from the 
very rapidly increasing wealth of astronomical informa- 
tion, and particularly of on-line information. 

CDS is operated at the Strasbourg astronomical Ob- 
servatory, under an agreement between French Institut 
National des Sciences de l'Univers (INSU) and Universite 
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Louis Pasteur, Strasbourg (ULP). CDS personnel created 
and implemented the Simbad data bank and maintain its 
data and software system. 

A detailed description of t he CD S on-line services can 
be found, e.g., in Egret et al. (1995) and in Genova et al. 
( |1996| , |1998| |200C|) , or at the CDS web siteQ. Questions or 
comments about the CDS services can be sent to the hot 
line question@simbad.u-strasbg.fr. 

1.2. SIMBAD 



The Simbad database contains identifications, 'basic 
data', bibliographical references, and selected observa- 
tional measurements for more than 2.7 million astronom- 
ical objects (November 1999). Data and information pub- 
lished in Simbad come from selected catalogues and tables 
and from the whole astronomical literature. 

The specificity of the Simbad database is to organize 
the information per astronomical object, thus offering a 
unique perspective on astronomical data. This is done 
through a careful cross-identification of objects from cat- 
alogues, lists, and journal articles. The ability to gather 
together any sort of published observational data related 
to stars or galaxies has made Simbad a key tool used 
worldwide for all kinds of astronomical studies. 

Simbad is the acronym for Set of Identifications, 
Measurements and Bibliography for Astronomical Data. 

The main access point to Simbad is the WWW home 
pagef]; there is a mirror copy at SAO, Harvard^. 

1.3. Historical background 

Building a reference database for stars - and, later, for 
extragalactic objects and all astronomical objects outside 
the Solar System - has been the first goal of the CDS: Sim- 
bad is the result of an on-going effort which started soon 
after the creation of CDS in 1972. Simbad was created 
by merging the Catalog of Stellar Identifications (CSI, 
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Ochsenbein et al. |1981 ) and the Bibliographic Star Index 
(Ochsenbcin |1982| ) as they existed until 1979. The result- 
ing data base (at that time, about 400,000 objects, mainly 
stars) was then expanded by the addition of source data 
from the catalogs and tables, and by new literature refer- 
ences. The database was extended to galaxies and other 
non-stellar objects in 1983 (Dubois et al. 198of ). For de- 
tails about the early developments of Simbad see Egret 

The first on-line interactive version of Simbad was re- 
leased in 1981, and operated at the Strasbourg Cronen- 
bourg computer center until December 1984, when it was 
moved to Universite Paris-Sud at Orsay, and operated 
there until June 30, 1990. The database is now hosted 
on a Unix server, at the Strasbourg Observatory. 

The original command line interface has been comple- 
mented by an interactive X- Window interface (XSimbad) 
in 1994, and by a World-Wide Web interface in 1996. 
There is also a client /server mode, providing quick re- 
sponses to simple queries, essentially for the name resolu- 
tion in archives and information systems (see Section ||) . 

For descrip tions of earlier stages of the d atabase, see 
Heck & Egret (|l986|) , and Egret et al. (|l99l|) . 



2. SIMBAD main features 

Simbad is, in the first place, a database of identifications, 
aliases and names of astronomical objects: in principle any 
name found in the literature - provided it is given as a 
syntactically correct character string - can be submitted 
to Simbad in order to retrieve basic information known 
for this object, as well as pointers to complementary data 
and bibliography. This implies a continuous careful cross- 
identification of objects from catalogues, lists, and journal 
articles. This ability to gather together any sort of pub- 
lished observational data related to stars or galaxies is the 
first key feature of Simbad. 

Another unique feature is the complete and up-to-date 
bibliographic survey of the astronomical literature: objects 
are associated with the references of all papers in which 
they are mentioned, independently of the aliases used to 
name the object. 

In addition, the Dictionary of Nomenclature (Section 
|8|) is an essential tool for managing the very complex 
nomenclature of objects found in the literature, and for 
matching naming variations with those adopted or simply 
accepted by Simbad. It also includes hints for helping to 
solve ambiguities, according to the type of object, or to 
the format. This is complemented by the sesame module 
within Simbad, for the management of possible variations 
in the naming of astronomical objects. 

The database management system of Simbad (Section 
||) has been developed in-house at CDS, using the concepts 
of object-oriented programming. 

Simbad is kept up-to-date (Section 0) on a daily basis, 
as the result of the collaboration of CDS with bibliogra- 



phers in Institut d'Astrophysique de Paris and the Paris 
and Bordeaux observatories. 

The statistical contents of the database (Section |||) 
can be summarized in a few figures as follows (the figures 
quoted are statistics of November 1999): 

— entries for about 2.7 million astronomical objects 
(stars, galaxies and all astronomical objects outside 
the solar system); 

— a cross-index of 7.5 million identifiers related to 4500 
astronomical catalogues and tables, lists, and observa- 
tion logs of space missions; 

— observational data from some 25 different types of data 
catalogues and compilations; 

— a bibliographic survey covering the astronomical litera- 
ture since 1950 for stars, and since 1983 for extragalac- 
tic objects: more than 3 million citations from 110,000 
different papers. 

3. SIMBAD astronomical contents 

3.1. The objects 

The Simbad data base presently contains information for 
about: 

— 1,500,000 stars; 

— 450,000 galaxies; 

— 100,000 other non-stellar objects (planetary nebulae, 
clusters, Hn regions, etc.); 

— and some 650,000 additional objects observed at vari- 
ous wavelengths (Radio, IR, X), and for which classi- 
fication is not yet assigned. 

The only astronomical objects specifically excluded 
from Simbad are the Sun and Solar System bodies. 

The Simbad database is primarily organized per as- 
tronomical object. The aim is to provide, as much as pos- 
sible, the user with all published information (identifica- 
tions, observational data, bibliographical references, and 
pointers towards external archives) concerning any given 
object, or list of objects. 

The two main channels for feeding the database are 
the following: 

— the daily scanning of papers published in the astro- 
nomical literature provides new references and new 
identifiers for existing objects, as well as opportunities 
to create new objects, using the basic data possibly 
given in the article; 

— the complete (or partial) folding of selected catalogues 
into the database serves as a basis for improving the 
completeness and multi-wavelength coverage of the 
database. 

Catalogues are selected for integration with priority 
given to those which can help provide an optimal support 
for the large projects conducted within the astronomical 
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community. A large effort was, for instance, devoted in 
recent years to stellar catalogues (PPM, HIC, CCDM), 
in the context of the Hipparcos project, and to multi- 
wavelength identifications (IRAS PSC, Einstein IE and 
2E catalogues, older X-ray catalogues, the IUE Merged 
Log, etc.). The Hipparcos and Tycho catalogues (ESA 



1997) have been recently included, and inclusion of the 



ROSAT All Sky Survey is planned in the near future. 

In parallel, the systematic scanning of the bibliogra- 
phy reflects the diversity and general trends of research 
in astronomy, and takes into account shorter lists. The 
published lists from the microlensing surveys, or e.g. the 
EUVE catalogues, were folded into the database as a re- 
sult of this scanning. 

When an object is found in the literature or in a cata- 
logue, its possible cross-identification with objects already 
in SiMBAD is systematically studied, before entering the 
reference and the new object name in the database. About 
4500 different names of catalogues or object lists from pub- 
lished papers can currently be found in SiMBAD, covering 
all the wavelength domains from high energy astrophysics 
to radio. 

When no proper name is suggested by the authors, or 
when the acronym generates an ambiguity with already 
existing ones, the current practice, shared with the NED 
database, is to create an acronym within brackets using 
the initials of the last names of the first three authors, 
and the year of publication. For example, [HFE83] 366 is 
the 366th entry in the main table of a paper by Helmer, 
Fabricius, Einicke and colleagues published in 1983. From 
the year 2000 on, the year will be noted with four digits 
(e.g., [ABC2000]). 

Many objects have more than one name, since the 
database contains more than 7.5 million object names for 
2.7 million objects. Examples of objects with more than 
50 identifiers, are the galaxy M 87 in Virgo, the bright 
stars Procyon and Capella, the quasar 3C 273, the Crab 
Nebula. 

To help the users with the complex nomenclature of 
astronomical objects, the CDS now maintains and dis- 
tributes on-line the Dictionary of Nomenclature of Ce- 
lestial Objects (first developed by Lortet et al. 1994 ; see 
Section |). 

3.2. The Data 

In the following, the word object will be used to designate 
a star, non stellar object, or collection of objects such as a 
cluster, which corresponds to an individual entry in Sim- 
bad. For each object, the following data are included when 
available: 

— Basic data: 

stars : object type, coordinates, proper motion, B and 
V magnitudes, spectral type, parallax, radial veloc- 
ity; 



galaxies : object type, coordinates, blue and visual in- 
tegrated magnitudes, morphological type, dimen- 
sion, radial velocity or redshift. 

other : object type, position, B and V magnitudes. 

— Cross-identifications from some 4500 catalogues and 
tables, either completely or partially included in the 
data base. 

— Observational data (also called measurements), for 
some 25 data types. A list of these types is given in 
the Table |. 

— General bibliography, including references to all pub- 
lished papers since 1983 citing the object under any of 
its designations. For stars, the bibliography starts as 
early as 1950, but with a smaller coverage of the liter- 
ature. SiMBAD also includes a few hundred references 
before 1950, but without any systematic trend. The 
bibliography gives access to abstracts and electronic 
articles when available (either directly from publish- 
ers, or through ADS). Currently about 100 journals 
covering the complete astronomical literature are reg- 
ularly scanned. A complete list is available on line^]. 

In the following, a more detailed description of some 
of these elements is given. 



3.2.1. Object type 

The object type refers to a hierarchical classification of 
the objects in Simbad, derived by the CDS team on the 
basis of the catalogue identifiers (as proposed by Ochsen- 
bein and Dubois 1992| ). From Star to Maser source, or 



Cluster of Galaxies, some 70 different categories, general, 
or very specific, are proposed (see examples in Table Q). 
A complete list is available on lino[]. 

This classification is intended to help the user select 
objects out of the dat abase (e.g. through the filter proce- 
dure, see Section 4.5). It is also a powerful tool for data 
cross-checking and quality control. It has been designed 
to be practical and useful, and complements other features 
also available in Simbad (morphological type or spectral 
type information, catalogues, and measurements). It can 
follow the evolution of astronomy, with the introduction 
of new categories recently appeared in the literature (e.g., 
in the last years, Low-Mass or High-Mass X-Ray binary, 
Microlensing event, or Void). 

Each class has normally a standard designation, a con- 
densed one (used in tables) and an extended explanation. 
The classification uses a hierarchy with four levels, reflect- 
ing our knowledge of the characteristics of the astronom- 
ical object. For instance, an object can be classified as a 
"Star" (this is level 1). If photometric observations have 
shown variability of the object, it can be classified as a 
"Variable star" (this is level 2) . Examples of level 3 and 4 
are "Pulsating variable" , and "Cepheid" . 



4 http:/ /simbad. u-strasbg.fr/guide/chH.htx 
http:/ /simbad. u-strasbg.fr/guide/chF.htx 
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This hierarchy of object types (and their possible syn- 
onyms) is managed in the database in such a way that 
selecting variable stars (V*) is understood as selecting ob- 
jects classified as V*, and all subdivisions (e.g. PulsV*, 
Mira, Cepheid, etc.). If the user is only interested in RR 
Lyrae type stars, he/she will use the RRLyr type, leaving 
aside all other variable stars for which the variability mode 
is different, or not known. 

The classification emphasizes the physical nature of 
the object rather than a peculiar emission in some region 
of the electromagnetic spectrum or the location in peculiar 
clusters or external galaxies. Therefore objects are classi- 
fied as peculiar emitters in a given wavelength (such as 
UV or IR source) only if nothing more about the nature 
of the object is known, i.e. it cannot be decided on the 
sole basis of the basic data whether the object is a star, 
a multiple system, a nebula, or a galaxy. For instance, if 
an object appears only in the IRAS catalogue, it is auto- 
matically classified as IR object: it is left to the user to 
decide to go further and to derive, e.g. on the basis of the 
IRAS colors, the probability for the source to be stellar or 
extragalactic. 

Because there is at most one object type per object, 
this classification should be used with caution when ex- 
tracting samples out of the database. This is typically the 
case for the wavelength types: using IR or X as a crite- 
rion cannot generate a sample of all IRAS sources, or all 
X-ray emitting objects, since a number of them are in fact 
classified as stars, galaxies, etc. 

3.2.2. Coordinates, Proper motion, Parallax, and Radial 
Velocity or Redshift 

The coordinates were originally stored in the database in 
the FK4 system for equinox and epoch 1950.0. A major 
change was undergone in 1999, when they were moved 
to the International C elestia l Reference System (ICRS, 
see Feissel & Mignard |l998|) at epoch 2000.0, after the 
publication of the Hipparcos and Tycho catalogues. The 
position data frame has become more complex, grouping 
together all data needed for computing the coordinates 
into any reference frame, at any epoch and equinox: the 
coordinates themselves, the proper motion, the parallax 
and the radial velocity or redshift. 

All these data contain the same subfields: the original 
data, displayed with a number of digits consistent with the 
announced precision of the data; a quality code from 'A' 
(reference data) to 'E' (unreliable origin); an error box (ei- 
ther a standard error, or an ellipse), and the bibliographic 
reference of the data. 

In earlier versions of Simbad, the determination of the 
position for another equinox used to take only precession 
into account. In the current version, a change of equinox 
takes into account not only the precession but also the 
proper motion, the reference frame (FK4, FK5, ICRS), 
and, when they are known, the parallax and radial veloc- 



Table 1. Object type classification: extracts from the ob- 
ject type table illustrating examples of the four levels of 
the classification scheme 



Level 


Standard 


Short 


Extended Explanation 




name 


name 




1 


Star 


* 


Star 


2 


*inCl 


*iC 


Star in Cluster 


2 


*inNeb 


*iN 


Star in Nebula 


2 


nnAssoc 


*iA 


Star in Association 


2 


*in** 


*j* 


Star in double system 


2 


V*? 


V*? 


Star suspected of Variability 


2 


Pec* 


Pe* 


Peculiar Star 


Q 
O 


riD 


ill) . 

riD 


Horizontal Branch Star 


3 


YSO 


Y*0 


Young Stellar Object 


3 


Em* 


Em* 


Emission-line Star 


4 


Be* 


Be* 


Be Star 


1 


Galaxy 


G 


Galaxy 


2 


PartofG 


PoG 


Part of a Galaxy 


2 


GinCl 


GiC 


Galaxy in Cluster of Galax- 


2 


GinGroup 


GiG 


ies 

Galaxy in Group of Galaxies 


2 


GinPair 


GiP 


Galaxy in Pair of Galaxies 


2 


Highj.G 


HzG 


Galaxy with high redshift 


2 


AGN 


AGN 


Active Galaxy Nucleus 


3 


LINER 


LIN 


LINER-type Active Galaxy 








Nucleus 


3 


Seyfert 


SyG 


Seyfert Galaxy 


4 


Seyfert_l 


Syl 


Seyfert 1 Galaxy 


4 


Seyfert_2 


Sy2 


Seyfert 2 Galaxy 


3 


Blazar 


Bla 


Blazar 


4 


BLLac 


BLL 


BL Lac - type object 


4 


OVV 


OVV 


Optically Violently Variable 


3 


QSO 


QSO 


Quasar 



ity. When no epoch is specified, the year of the equinox is 
used by default. 

Data come from various sources. When astrometric 
data are available, the most accurate one has been se- 
lected for the 'basic data'. Other values may be available 
as measurements (in t he po s type). The Hipparcos and 
Tycho catalogues (ESA 1997 ) constitute the major source 
of positions for stars. 

The coordinates precision may vary from 1° to 1/10 
mas. The default display format provides equatorial coor- 
dinates in the ICRS system at epoch 2000.0, and in the 
FK5 system at equinoxes 2000 and 1950, as well as galactic 
coordinates. Coordinates in the FK4 system, and ecliptic 
or super-galactic coordinates can be computed on request. 

The proper motions cos<5, us) are given in 
mas/year, together with their standard errors (in 
mas/year). The primary source of proper motions is the 
Hipparcos and Tycho catalogues (ESA 1997). 

The errors for positions or proper motions are ex- 
pressed as error ellipses, made of three numbers, within 
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brackets: the major axis, the minor axis, and the position 
angle of the major axis (measured from North to East). 
Major and minor axes are expressed in mas for the posi- 
tion, and mas/yr for the proper motion; the position angle 
is expressed in degrees, in the range [0°, 180° [. 

When available, the stellar parallax is given in mas, to- 
gether with the associated error within brackets. The pri- 
mary source is the Hipparcos and Tycho catalogues (ESA 



19971 ) 



Radial velocity (in km/s), or redshift (for extragalactic 
objects) are currently available for some 160,000 objects. 
They are stored in their original type (either redshift, or 
radial velocity in km/sec), associated with the standard 
error. Display can be done in the original type or forced 
to be one of the two types, using the corresponding trans- 
lation formula. 

Stellar radial velocity data have been compiled with 
the collaboration of Observatoire de Marseille. 

For extragalactic objects, up-to-date redshift informa- 
tion has recently been imported from the NASA/IPAC 
Extragalactic Database (NED, Hclou ct al. 2000 ) as a re- 
sult of the ongoing exchange agreement: the Simbad team 
is providing NED with bibliographic coverage of extra- 
galactic objects for all astronomical journals, and is being 
given access, in return, to extragalactic data collected by 
NED. 

Tables from individual articles constitute the other 
major source of information. 

3.2.3. Magnitudes 

B and V magnitudes are given, when possible, in the John- 
son's U BV system. Both magnitudes may be followed by 
a semicolon meaning they cannot be made homogeneous 
to the UBV system. In addition the following flags may 
appear: 

— a 'D' flags a joint magnitude in a double or multiple 
system; 

— a 'V indicates a variable magnitude and is followed 
by a coded index giving a rough estimate of the am- 
plitude: 



code 


definition 


1 


1/100 mag. 


2 


1/10 mag. 


3 


1 mag. 


4 


more than 1 mag. 


? 


suspected variable 



When possible the magnitudes have been taken from 



the Tycho Reference Catalogue (H0g et al. 1998) where 
B and V magnitudes are derived from the original Bt 
and Vt- Ano ther major source is the UBV compilation of 
Mermilliod ( 1987j ). Otherwise the data would come from 
one of the published papers associated to the object. 



3.2.4. Stellar Spectral type 

The spectral types of stars have been selected preferably 
in the Michigan Catalogues of T wo-Di mensional Spec- 
tral Types for the HD stars (Houk [1975 , and seq.), or in 
the bibliographical surveys of MK classifications (Jaschek 
|1978 ). In the absence of a full MK classification, the HD 
spectral type is recorded. 

Most of the spectral types need less than 5 characters, 
but this field can be as long as 36 characters. 



3.2.5. Morphological type and Dimension of galaxy 

The morphological types of galaxies have been selected 
primarily from the Uppsala General Catalogue of Galax- 
ies (UGC, Nilson 1973 ), the Morpholog ical C atalogue of 
Galaxies (MCG, Vorontsov-Velyaminov, 1962), and other 
catalogues (see Dubois et al. 1983 ). 

In complement, the following data, primarily from 
UGC, are given, when available, for galaxies: 



log D25 logarithm of the major axis a expressed 

in tenths of arc minutes; 
logi?25 logarithm of the ratio a/6 where a and 

b are the major and minor axis; 
orientation orientation angle (in degrees) 
(inclination) inclination (in units of 15° from to 7) 

3.3. Cross-identifications 
3.3.1. Aliases 

Cross-identifications of stars and galaxies have been 
searched for Simbad entries from (currently) about 4500 
source catalogues and tables, included, either completely 
or partially, in the data base. The index of 7.5 million 
aliases, thus constituted, is one of the unique features of 
the Simbad database. 

Aliases may serve as entry points for related catalogues 
and tables (e.g. in VizieR). Cross- fertilization of a given 
research with previous studies of the same object pub- 
lished in the astronomical literature is made directly pos- 
sible from the alias list. 

The index of names and aliases constitutes the basis 
for the Simbad name resolver which provides, in response 
to any object name, the set of coordinates corresponding 
to the object position on the celestial sphere, or the list 
of papers citing the object. The name resolving power of 
Simbad is used by many archives and information systems 
(such as the archives of Hubble Space Telescope or Euro- 
pean Southern Observatory, the High Energy Astrophysics 
Science Archive Center, the Astrophysics Data System, 
servers of the Digitized Sky Surveys, etc.). 
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There is no Simbad preferred name for objects^]: all 
aliases can be equally used. A short list of major cata- 
logues is used internally to put at the top of the list the 
most common name according to the object type (e.g., 
Messier or NGC identifier for galaxies and nebulae). All 
other identifiers are presented in alphabetical order. 

A command of the Simbad native node ('selectid'), 
and an option in the sampling form of the WWW inter- 
face, allow the user to impose a list of identifiers to be 
used when displaying object lists. 

3.3.2. Multiple systems 

It is to be noted that for a double system in which the com- 
ponents can be observed separately, Simbad frequently 
includes three entries: A and B components, and an ad- 
ditional entry for the joint system (AB), the latter entry 
carrying the observational data and references related to 
the system as a whole. This has to be taken into account 
in statistical studies such as stellar counts. 

3.4- Observational data 

Observational data are presently given for the measure- 
ment types listed in Table ||. 

For each data type, one can retrieve individual data 
with their bibliographical references, and, when available, 
weighted means computed from existing observed values 
by specialists in the related field. 

When measurements are listed as a result of a Simbad 
query, they are normally preceded by a header providing 
a very short title to each listed parameter. 

The important role now played by the VizieR 



Table 2. List of measurement types currently included 
in Simbad, together with the number of entries (October 
1999). 



database of catalogues (Ochsenbein et al. |2000| ), com- 
ing with easier interoperability of services, is changing 
the strategy for inclusion of observational measurements 
into Simbad. Let us take t he ex ample of the Hipparcos 
and Tycho catalogues (ESA |l997|) : once the HIP or TYC 
identifier is available from Simbad it appears convenient 
enough to provide the user with a WWW link to the corre- 
sponding data in VizieR rather than overloading the Sim- 
bad database with the full Hipparcos and Tycho cata- 
logues. This functionality is currently being implemented 
for important catalogues which have already been cross- 
identified. 

As a complement, the WWW interface includes point- 
ers to external archives, currently: the I NES d atabase of 
the IUE project (Rodriguez-Pascual et al. 1999 ); the high- 
energ y obs ervational archives at heasarc (HEASARC 
team |l995| ). 



Name 


Observational data 


# 


CEL 


Ultraviolet photometry from Celescope 


5230 


Cl.G 


Clusters of Galaxies (Abell &z Corwin 


5345 




1989) 




Einstein 


Einstein Observatory Soft X-ray 


5668 




Source List 




GEN 


U BV B 1 B 2 V\G Geneva photometry 


3650 


GJ 


Absolute magnitudes and spatial veloc- 


2368 




ities of nearby stars 




Hbet 


Bp index 


32278 


HGAM 


Bj equivalent width 


723 


IRAS 


[RAS Point Source Catalog 


245784 


IRC 


KI photometry from Two Micron Sky 


4880 




Survey 




IUE 


International Ultraviolet Explorer 


66805 




(Merged Observation Log) 




JP11 


UBVRIJKLMNB 11-colour Johnson 


5892 




photometry 




MK 


Stellar spectral classification in 


190231 




Morgan-Keenan system 




oRV 


Stellar Radial velocities (also under 


68783 




GCRV) 




PLX 


Trigonometric parallaxes 


16329 


pm 


Proper motions (from various astro- 


243065 




metric catalogues) 




pos 


Positions (from various astrometric 


668953 




catalogues) 




ROT 


Rotational velocities (Vsini) 


7181 


RVEL 


Radial velocities of extragalactic ob- 


36552 


jects 




SAO 


Positions and proper motions from 


252384 




SAO catalogue 




TD1 


Ultraviolet magnitudes from TD1 


25972 




satellite 




UBV 


Johnson UBV photometry 


141215 


uvby 


Stromgren uvby photometry 


37986 


V* 


Data related to variable stars 


25764 


z 


Redshifts (of distant galaxies and 


88888 




quasars) 





3.5. Bibliographical references 

One of the key features of the Simbad astronomical 
database is the unique coverage of bibliographical refer- 
ences to objects. The bibliographic index contains refer- 
ences to stars from 1950 onwards, and to galaxies and all 
other objects outside the solar system from 1983 onwards. 
Presently (November 1999) there are about 3 million ref- 
erences taken from 110,000 papers published in the 100 
most important astronomical periodical publications. 



6 In the early tim es of the Catalog of Stellar Identifications 
(Ochsenbein et al. 1981), the Durchmusterung number had 
been used as a preferred name for stars. 



3.5.1. Bibliographical data 

Articles are scanned in their entirety, and references to 
all objects mentioned in the title, in the abstract, in the 
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text, in the figures, or in the tables are included in the 
bibliography. Tables larger than 1000 objects are usually 
considered as catalogues and processed separately. 

No assessment is made of the relevance of the cita- 
tion in terms of astronomical contents: the paper can be 
entirely devoted to the object, or simply give a side men- 
tion of it - in both cases this gives a reference in Si mbad . 
Note that, for instance, the NED team (Helou et al. 2000 ) 
applies a different strategy when covering bibliography of 
extragalactic objects, and tends to select only those pa- 
pers that appear most relevant. Clearly, SiMBAD approach 
favours exhaustivity, at the cost of increased information 
noise. 

A code (nicknamed bibcode) is assigned to each con- 
sidered paper: this 19-digit bibcode contains in principle 
enough information to locate the article (including year of 
publication, journal, volume, page, etc.). 

When one retrieves the bibliography of a Simbad ob- 
ject, a list of codes is usually given, and - according to 
the options used - these codes are automatically matched 
against a bibliographic file which provides the full refer- 
ence, title and list of authors for each citation, together 
with an anchor pointing to the electronic version of the 
article. 

Currently, in Simbad, about 50% of the objects have 
no bibliographic reference. Among the most cited objects 
(more than 2000 references) are the Large Magellanic 
Cloud, M 31, 3C 273, and the supernova SN 1987A. 

3.5.2. Bibliographic reference coding convention 

The structure of the 19-digit bibcode has been defined in 
close collaboration with the NED group at NASA/IPAC 
so that both d ataba ses share the same coding system 
(Schmitz et al. 



19951) 



It is also used, with some adjust- 
ments, by the Abstract Ser vice o f the Astrophysics Data 
System (ADS, Kurtz et al. 2000| ), and by the electronic 



journals (see e.g., Boyce & Dalterio |l996 ). Reference codes 
have the following general structure: 

YYYYJJJJJVVVVMPPPPA 

YYYY Year of the publication. 

JJJJJ Standard abbreviation for the periodical. 

WW Volume number (for a journal) or, in the second 
character of this field, one of the following abbrevia- 
tions for another publication: B (book), C (catalogue), 
P (preprint), R (report), S (symposium), T (thesis), U 
(unpublished). 

M Specific qualifier for a paper: 
L letter 

p pink page (in MNRAS) 
a-z issue number within a volume 
A-K issue designation used by publisher 
Q-Z to distinguish articles on the same page. 
PPPP Page number. 



A First letter of the first author's last name (or ':' if the 
first author cannot be identified). 

Example: 1991A&A. . .246L. .24M for Astron. Astro- 
phys. 246, L24, 1991, a Letter to the Editor of Astronomy 
& Astrophysics, by Motch et al. 

For a complete description see Schmitz et al. ( 1995 ), 
or the WWW server^. 

3.5.3. Comments in the references 

Several types of comments are associated with the refer- 
ences in Simbad and normally displayed after the refer- 
ence: 

— General comments: they are often comments added by 
the bibliographers, about the problems encountered 
while cross-identifying the objects mentioned in the 
paper, typos in object names, etc. 

— Notes about the existence of associated electronic ta- 
bles, or abstracts in the CDS server. Papers including 
no object are also flagged. 

— Information on how the quoted objects are named 
in Simbad (comments related to the Dictionary of 
Nomenclature of Celestial Objects). 

3. 6. Statistical aspects of the Data Contents 

The astronomical content of Simbad results from the com- 
plex process of folding into the database a selection of 
important catalogues, and of surveying the complete as- 
tronomical literature. 

This can be illustrated by the histogram in V mag- 
nitudes of Figure [|. The coverage is reasonably complete 
up to beyond magnitude 10 for stars, after the inclusion 
of the Tycho catalogue. Many objects in the range 12 to 
26th mag. come from extensive studies of objects in se- 
lected sky areas: deep fields, external galaxies, etc. 

Some well-known very large catalogues are not part 
of Simbad: for instance the Hubble Telescope Guide Star 



Catalogue (GSC, Lasker et al. 1990) is not systematically 
included (even if GSC identifiers have been added for all 
Tycho stars present in Simbad). This results from a com- 
promise aiming to save database load as well as man- 
power for cross-identification and quality control. Note 
that VizieR and Aladin give access to the full GSC cat- 
alogue (and to even larger catalogues and databases such 
as USNO-A, DENIS, 2MASS). 

Fig. H illustrates the increase of the data contents of 
the database in the years 1990 to 1999. 

4. SIMBAD structure and query management 

Simbad query mechanism can be summarized by the fol- 
lowing key features: 

7 http:/ /cdsweb.u-strasbg.fr/simbad/refcode.html 
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Fig. 1. Histogram of the number of objects in Simbad per 
magnitude interval (V magnitude; logarithmic scale). 



Fig. 2. Histograms of different types of database entries 
for the years 1990 to 1999: number of objects (top left), 
number of identifiers (top right), number of references 
(bottom left), number of citations (bottom right). The 
numbers given are the total numbers of entries present in 
the database at the end of the corresponding year. 



— Database queries can be made mainly through: 

— identifiers (names of astronomical objects) and lists 
of identifiers, 

— sets of coordinates (retrieving one object by its po- 
sition on the sky, or extracting all objects lying in 
a given direction), and 

— sampling criteria (or filters). 

— Data output is driven by formats. The user may define 
his/her own formats or modify existing ones. Output 
files can be saved and mailed to the user. 

— The user interface is adaptable to user preferences. 

The database management system of Simbad has been 
developed by the CDS, using the concepts of object- 
oriented programming. 

4-1- Object-oriented concepts 

The command language is using the concepts of objects (or 
agents). Typical object classes are: astronomical object, 
object list, database, session, reference list, filter, format. 
Examples of methods are: display, describe, bye (quit). 

This structure is only visible for the user of the com- 
mand line interface. The WWW interface is rendered quite 
independent of the database structure. 



4-- 2. Indexing 

Simbad is organized for optimized access by identifier 
(through an index table of object names) and by position, 
through an index of small regions. 

Identifiers : A B-tree file contains all identifiers allowing a 
fast access to any of them. For each identifier, a record 
contains a pointer to the astronomical object itself in 
the main database. 

Position : Indexing by coordinates is done in two steps: 
the coordinates are mapped into a set of boxes. Sim- 
bad uses the spherical-cubic projection - a technique 
also used, e.g., for the Cosmic Background Explorer 
(COBE) data: the celestial sphere is projected onto the 
six faces of a cube, giving six boxes at the first level. By 
dividing each face into four parts, one obtains a par- 
tition at level two. Further levels are obtained by fur- 
ther divisions of each box into four sub-boxes. Through 
this mechanism one obtains 6144 boxes at the level 5 
with an average size of 6 square degrees and an aver- 
age number of objects of 500. Box #6145 contains all 
objects without recorded position. 
In order to optimize access to objects in a coordinate 
box, all objects belonging to a box should be physically 
grouped in a common place in the database. This is 
done through a clustering mechanism placing objects 
from the same box in data blocks linked together in 
the database files. 
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When a set of criteria includes some limitations in co- 
ordinates, this generates the definition of a list of boxes 
including the requested area: all entries from these boxes 
are read and checked against the whole set of criteria. 

When a set of criteria includes no limits in sky posi- 
tion, the complete database must be scanned - a long and 
somewhat expensive operation, which takes typically 15 
minutes in the current hardware configuration. 

4-3. Query by identifier 

In principle any name found in the literature - provided it 
is given as a syntactically correct character string - can be 
submitted to the database in order to retrieve information 
known for this object. 

The general syntax of an identifier is the abbreviated 
catalogue name (or acronym: generally one to four charac- 
ters), followed by a number or a name (character string) 
within the catalogue. 

Object names such as Vega and Altair, but also 
Barnard's star, Crab Nebula, Sgr A, HDFN, or HDFS are 
stored in the database in a specific catalog called 'name', 
while star names in constellations, such as a Lyrae, are 
stored in the catalogue '*', and variable stars (such as RR 
Lyrae) in the catalogue 'var' (also called 'V*'). 

The user can generally type Vega, Altair, alf Lyrae 
(or alf Lyr): the sesame name resolving module (Sec- 



File Edit View Go Bookmarks Options Directory Window 



tion ^2 ) is used for guessing the catalogue and making the 
internal conversion. There are however some difficult cases 
in which the name keyword remains necessary, such as in 
NAME SGR 1900+14 where SGR stands for Soft Gamma 
Repeater. 

In addition the following hints can help the user un- 
derstand the best way to submit an identifier to Simbad: 

Case sensitivity: Simbad is not case-sensitive at this 
level: ALF AQL or alf Aql are, for instance, both valid. 
There are some exceptions to the rule, such as the cases 
of the star cluster RMC 136a, or the star in a multiple 
system VdBH 25a A, for which case-sensitivity may be 
necessary for solving format ambiguities. 

Greek letters: should be abbreviated as three letters: alf, 
bet, for a and /3, but also mu. nu. and pi. (with a 
dot), for [i, v and n. 

Constellations: constellation names should be abbrevi- 
ated with the usual three letters: alf Boo, del Sct,FG 
Sge, NOVA Her 1991. The full list is available on-line^. 

Multiple systems: Identifiers of a multiple system may 
generate a list of the objects of the system. For in- 
stance, ADS 5423 calls for the four components, A to 
D, of the stellar system around Sirius. This is true only 
for some specific identifiers. 

Stellar clusters: Clusters which have no NGC or IC 
number are named under the generic appellation 
CI followed by the cluster name and number: e.g., 

8 http:/ /simbad. u-strasbg.fr/guide/chB.htx 



SIMBAD Query Result 



Object query i search m 81 

Basic data : M 81 — LINER-lype Active Galaxy Nucleus 

ICRS 2000.0 coordinates 09 55 33.22 +69 03 55.1 D ~ 

FK5 2000.0/2000.0 coordinates 09 55 33.22 +69 03 55.1 

FK4 1950.0/1950.0 coordinates 09 51 27.36 +69 18 08.3 

Galactic, coordinates 142.09 +40.90 

i .-i. i-i. ■ : n :j i e .~ i- 1 

Bmagn, Vmagn, Peculiarities 8.1, 
Morphological type Sib 
RadialvdociryorRedshift v -49 |>] 

Aladin image : Preview. Interactive Viewer 

Query and Plot around with radius jlO arcmin f Uses first coordinate frame) 



Identifiers (26): 
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K72 21Sa 
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■ :: ■ .. ■ ■:. ■ : ■ : 
TC 714 

1 0951.4+6918 
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coin see also NAME II SI GROUP 
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Measurements : 

Einstein(l) UtAS (1) IUE (13) KVel (5) 



display All measurements 



21M 



Fig. 3. Example of Simbad response page for a query con- 
cerning M81 (only the first part of the response is visible 
here). 



CI Blanco 1 is the 1st stellar cluster named by Blanco. 
Stars in clusters may belong to a 'main' designation 
list, or to subsequent lists. NGC 5272 692 is star 692 
in the list by Von Zeipel, considered as the main list. 
Subsequent lists have designations starting with CI*. 
Examples: CI* NGC 5272 AC 968 (list by Auriere & 
Cordoni); CI* Melotte 25 VA 13 (13th star in the list 
by Van Altena for Melotte 25 - the Hyades cluster); 
CI* Collinder 110 DI 1101 (list by Dawson & Ianna - 
there is no 'main' list for this cluster). More details are 
available in the on-line description^. 
Unknown name ?: If the object name seems unknown to 
Simbad, the user is advised to enter the coordinates 
of the object: the object may actually exist in the 
database under a different designation. Submitting the 
identifier, or the name of the first author of the cat- 
alogue, to the Dictionary of Nomenclature may also 
give useful clues. 

Figure |3| illustrates the response received from the 
database after submitting the identifier 'M 81'. In the 
identifier list, the meaning of acronyms, such as [VDD93], 
is explained through a link to the on-line Dictionary of 
Nomenclature. 
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The user interface provides an option for querying 
around objects, with a radius set by default at 10'. This 
is equivalent to a query by position using the object coor- 
dinates. 

It is also possible to generate the list of 10 or 25 next 
objects following a given identifier, or to submit a list of 
object names, stored in a file with one identifier per line. 

4-4- Query by coordinates 

Query by coordinates can be used to retrieve all objects 
in a circular field defined by the coordinates of the center 
and a radius. 

The coordinates can be replaced by the name of an 
object lying at the center of the field, in which case the 
coordinates are found through an internal query to Sim- 
bad. The radius can have any size (default value is 10'). 
Queries with a radius smaller than 1-2° are answered quite 
instantaneously. 

4-5. Sampling 

The sampling mode (also named filter) allows users to 
define criteria for selecting objects in Simbad. 

The user may extract objects which satisfy one set 
of coordinate criteria, several physical criteria (using a 
simple syntax), objects which have specified identifiers or 
measurements, and, finally, objects having citations within 
a range of years. 

The WWW interface provides an interactive form 
which presents all possible sampling options. 

The resulting list may be ordered according to sort 
criteria and, furthermore, it is possible, through the com- 
mand line mode, to define precisely the output format. 

Note that reading the whole database for extracting 
a sample spread on the whole celestial sphere is possible, 
but quite time-consuming (as mentioned above). The user 
is thus encouraged to test the filter on a limited region of 
the sky, before applying it to the whole database. 

4-6. Charts and sky maps 

After a sampling by position the user can ask for the cor- 
responding sky plot. This feature is only available through 
the WWW interface and is generally optimized for a ra- 
dius range of 10-60 minutes. 

The maps display the objects with different symbols 
according to object type; symbol size for stars varies with 
object magnitude (see Figure ||). The maps are clickable 
and return the object in Simbad corresponding to cursor 
position. 

The WWW interface provides also direct access to the 
Aladin interactive digitized atlas (Bonnarel et al. 2000 ) 
as illustrated in Figure ||. 



Simbad clickable Map 

Coord = 09 55 33.22 +69 03 55.1 
FK5 2000.0 2000.0 
Radius = 10 arcmin 



CDS ■ Simbad ■ VizieR ■ Aladin ■ Catalogues ■ Nomenclature ■ Biblia ■ StarPages ■ AstroWeb 
Click on an object to display its data from Simbad 

(Using the middle button of your mouse will display the result in a new window) 




•••••< 



2 2 3 4 5 6 7-10 11-15 16-26 »20 



: li 6H:54:07 



©ULP/CNRS - Centre de Donnees astronomiques (ie Strasbourg ,'■< ] 

Fig. 4. Example of finding chart around M81 (radius of 
the circular field: 10'). 



4-7. Batch mode 

Simbad can be queried in batch mode, by 
submitting a mail to the special address 
smbmailOsimbad . u-strasbg . f r . 

This is especially useful in case of poor interactive 
connectivity, or for submitting time-consuming queries or 
lists. A WWW form0 helps to prepare the submission. 

4-8. Resolving a bibliographical reference code 

It is possible to obtain a complete bibliographical refer- 
ence, by entering the corresponding reference code (bib- 
code). 

A reference code can be supplied without indicating 
all the fields: the first reference corresponding to the trun- 
cated code will be displayed. An ampersand (&) should be 
added at the end of the truncated bibcode. 



http:/ /cdsweb. u-strasbg. fr/simbad/batch. html 
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and the user needs to have a user name and password 



i 57 06. UV - 



& [PR951 50059 09 54 45.64 +68 56 59.9 ? 14.95 : 13 81 ■ 



jtS"; Unsigned Java Applet Hindow 



Fig. 5. Use of Aladin for displaying Simbad entries (red 
diamonds) on top of a DSS-I/STScI image around M81 
(width of the field: 14.1'). 



If.. 9. Additional tools 

Additional tools include special commands for querying 
auxiliary databases, on-line help, log files, etc. More de- 
tails can be found in the Simbad User's Guide or on the 
Web pages. 



5. User Interfaces to SIMBAD 

There are several user interfaces to Simbad. New users 
are advised to go directly to the WWW interface, unless 
they have very specific needs. 

— The Web interface is currently the easiest access mode 
to Simbad. This interface takes benefit of the WWW 
features to provide the user with additional links to 
internal documentation, associated services (Aladin, 
VizieR), and external archives (currently: the INES 
database of the IUE project; the high-energy obser- 
vational archives at heasarc). Some features, such 
as the finding chart, have been specifically designed 
for the Web and are not available through the other 
modes. 

— The command line interface is the basic underlying in- 
terface to the database, which serves as a basis for the 
other more user-friendly interfaces. During many years 
it was the sole access mode to the database, and many 
users who are accustomed to the commands may find 
it quicker and more versatile. It implies a remote login 
(through telnet) on the simbad machine in Strasbourg, 



(see Section 3.1). 

— A graphical interactive user interface to SiMBAD, 
XSimbad, taking benefit of the X Window environ- 
ment has been developed in 1993-94 for distribution 
to users working in a Unix environment. It is now ob- 
solete, because all the functionalities, and additional 
ones, are more easily available through the Web. 

— A client/server package is distributed on request to 
data managers of archives and information systems, 
when they need to organize the most efficient access 
to Simbad for the resolution of object names into the 
corresponding position, or the retrieval of other infor- 
mation provided by the database, such as the reference 
list for a given object. Written currently in C language, 
it can easily be plugged into any application able to 
access C routines. Distribution is subject to CDS ap- 
proval. 



6. SIMBAD usage 

6.1. Charging policy 

Simbad is a charged service. The telnet access is pro- 
tected by userid/password, and the WWW access is pro- 
tected either by IP address or by password. 

Users have to register, and get a userid/password from 
the CDS staff (or from the U.S. agent for American users). 

In the U.S. there is no invoicing for the end-user be- 
cause the charges are covered globally by NASA for all 
U.S. users. 

In Europe, the same situation is also true for users from 
ESO and ESA member states, thanks to an agreement 
signed with European Southern Observatory and ESA 
Space Telescope European Coordinating Facility (since 
January 1995). 

Special arrangements also exist or are currently being 
negotiated with other countries, including Canada, Aus- 
tralia, and Japan. 

6.2. Usage statistics 

There are currently (November 1999) some 7000 user ac- 
counts from 64 different countries. The development of 
the WWW access makes difficult to keep precise track 
of the individual usage statistics, but the global statis- 
tics show that the world wide interest for accessing the 
Simbad database continues to increase regularly over the 
years. 

The number of Simbad queries evolved from about 
30,000 per month in 1997 to about 100,000 per month 
in 1999. About 50% of the queries come through the 
client/server mode. 

A mirror copy of Simbad has been established at CfA 
(Harvard) for convenience of US users, and about one 
third of the queries are currently processed on the mirror 
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Table 3. Main institutes associated to the CDS for im- 
proving the data contents of Simbad 



Bibliography 

Astron. contents 
Galaxies 

Photometry 

Astrometry 

Binary stars 
High-energy 



Observatoire de Paris, Institut 
d'Astrophysique de Paris, and Ob- 
servatoire de Bordeaux 
GRAAL, Montpellier 
Observatoire Midi-Pyrenees and 
NASA/IPAC Extragalactic Database 
Observatoire de Geneve and Institut 
d'Astronomie de Lausanne 
Astronomisches Rechen Institut, Heidel- 
berg 

Observatoire de Besancon 
Observatoire de Strasbourg 



site (including name resolving activities for the ADS and 
major US NASA archives). The mirror copy is managed 
by CDS and updated every night. 



7. SIMBAD updates and quality control 

7.1. Updating SIMBAD 

Simbad is kept up-to-date on a daily basis, as the result 
of the collaboration of the CDS team, in Strasbourg, with 
bibliographers in Observatoire de Paris (DASGAL), Insti- 
tut d'Astrophysique de Paris, and Observatoire de Bor- 
deaux (Laloe et al. 1993; Laloe 1995| ) who systematically 
scan the articles published in some 100 astronomy jour- 
nals. 

The references are updated very soon after reception 
of the journal issues, and in some cases directly from the 
journal table of contents, through agreements with the Ed- 
itors. New data concerning the objects (identifiers, basic 
data), and new acronyms for catalogs or tables are being 
entered when appropriate. 

The inclusion of a large catalogue in the database is of- 
ten a long-term task which may span over several months 
or years; the collaboration of specialists in the different 
fields is systematically sought. 

The improvement of the Simbad astronomical con- 
tents relies on a network of collaborations: a list of the 
main current contributors is given in Table ^. More gener- 
ally, help of other contributing institutes and authors, too 
numerous to be cited here, is gratefully acknowledged. 



7.2. Quality control 

The data contained in Simbad are also permanently up- 
dated, as a result of errata, remarks from the bibliogra- 
phers (during the scanning of the literature), integration 
of lists and catalogues, quality controls, or special efforts 
initiated by the CDS team to better cover some specific 
domains (e.g., multi- wavelength emitters and complex ob- 
jects). 



Requests for corrections, errata, or suggestions are reg- 
ularly received from Simbad users through a dedicated hot 
line, at e-mail address question@simbad.u-strasbg.fr. 
A few dozens of messages are usually received every week, 
and processed on a daily basis by the member of the team 
who is on duty for that week, or transmitted to the key 
person in case of specialized questions. Remarks received 
from the users by this way are especially welcome, as 
they help the CDS team to improve the database contents 
through the scrutiny of specialists' eyes. 

Developing new tools for quality control of the 
database is a major challenge for the future, and CDS is 
exploring possible solutions. Multivariate analysis applied 
to bibliographic information retrieval has been proven a 
possible tool for developin g qua lity control in a database 
such as Simbad (Lestevcn 1995 ). 



7. 3. Towards automation of updating procedures 

The advent of electronic publishing brings new perspec- 
tives for improvement and automation of the updating 
procedures. 

In a first place, tables of contents of the major jour- 
nals are now received electronically through the network, 
thanks to journal Editors and Publishers, thus reduc- 
ing the risk of errors. Regularly, a number of electronic 
lists of objects are also folded into Simbad through semi- 
automatic procedures. The next step will be the automatic 
flagging of object names in the text of the articles: this has 
now become a very interesting medium-term goal. 

Two ways of achieving this flagging are currently being 
considered: 

— the first one is to ask the authors, with the help of 
the Editors of electronic journals, to flag astronomical 
object names in their text; this can be done, for in- 
stance, by the use of a \object{ } command within 
the TgX or DTgX source, which will be eventually used 
to build an anchor pointing towards Simbad, or an- 
other database, in the on-line version made available 
on the network. This approach has been adopted by 
the Editors of Astronomy & Astrophysics. 

— another approach is the use of intelligent search tools 
for identifying object names within the electronic ver- 
sion of the paper, using a set of syntactic and semantic 
rules, and the Dictionary of Nomenclature as a refer- 
ence database for already known objects. 

The first approach seems safer, provided the authors 
understand what exactly they are being required, and ac- 
cept this (minor) additional work load. The latter im- 
plies a lot of fine tuning from the system developers. 
The current experience with the handling of publications 



(Lesteven et al. 1998) suggests that both approaches may 
be needed, and that a careful quality control, including 
final check by an expert, will probably remain necessary 
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to avoid errors or misinterpretations, and to ensure appro- 
priate completeness. 



8. Nomenclature 

8.1. The Dictionary of Nomenclature 

Designations of astronomical objects are often confusing. 
A complete list of astronomical designations has been col- 
lected and published by Lortet et al. ( |1994| ) in the Dic- 
tionary of Nomenclature of Celestial Objects outside the 
Solar System. 

This information is available on-line through the info 
command, or on the WWW0 This service is the elec- 
tronic look-up version of the Dictionary which is now un- 
der the responsibility of CDS. It is kept up-to-date on 
a weekly basis; about 15 new acronyms are incorporated 
every week. 

The Dictionary currently provides full references and 
usages about some 5000 different acronyms. It is used by 
the International Astronomical Union as a reference for 
its recommendations related to nomenclature. 



8.2. The sesame module 

The sesame module is used inside Simbad for the manage- 
ment of possible variations in the naming of astronomical 
objects. It is based on a list of rules, written as regular 
expressions, allowing translation of the submitted name 
into its Simbad canonical form; it is only made visible to 
the user when a message mentions the submitted syntax 
and its translation. 

There are cases where ambiguities cannot be solved. 
This is actually specific to the broad context of Simbad. 
Let us give an example: in the context of extragalactic 
objects 'N' is a possible abbreviation for 'NGC (accepted 
by NED); but people studying Novae would frequently use 
'N' as an abbreviation for Nova, people studying Hn re- 
gions would use it for naming nebulae in the Magellanic 
Clouds (LHA 120-N or LHA 115-N), and 'N' has also been 
found in the literature for cluster stars studied by Nord- 
lund in NGC 2099 (CI* NGC 2099 N), for stars studied by 
Neckel ([N78]), or even for 'New' parts of the galaxy NGC 
1275 ([NJS93] in Simbad). When a name like 'N 1992' 
is submitted to Simbad the ambiguity cannot be solved 
without requesting additional information from the user. 



9. Integration of SIMBAD into the CDS Hub 

While the CDS databases have followed different develop- 
ment paths, the need to build a transparent access to the 
whole set of CDS services has become more and more obvi- 
ous with the easy navigation permitted by hypertext tools 



such a development, by giving comprehensive simultane- 
ous access to Simbad, the VizieR Catalogue service, and 
to external databases such as NED, using a client/server 
approach and, when possible, standardized query syntax 
and formats. 

In order to be able to go further, the CDS has built a 
general data exchange model, taking into account all types 
of information available at the Data Center, known under 
the acronym of GLU for Generateur de Liens Uniformes 



(Genova et al. 2000). Aladin has become the prototype of 



http:/ /vizier. u-strasbg.fr/cgi-bin/Dic-Simbad 



- Uniform Link Generator (Fernique et al. 199S). 

In the current stage of development, the WWW in- 
terface to Simbad provides access to Aladin previewer 
(reference image around one object), and to the Aladin 
interactive Java program (see Bonnarel et al. 2000). There 
are also links between Simbad and the bibliographic ser- 
vices developed or mirrored at CDS, and more generally 
to the ADS and the electronic journals. 

While this article is written stronger links between 
Simbad and VizieR are just being created allowing even 
easier transfers of data and information between both ser- 
vices. This will also make easier to build new links pointing 
to distributed data archives, beyond those already existing 
(currently: IUE/INES and HEASARC). 

10. Future developments 

In the near future, the CDS team expects to go on enrich- 
ing the database contents and system functionality. The 
users play an important role in that respect, by giving 
feedback on the desired features, on the user-friendliness 
of the interfaces, etc. 

In the context of interoperability of distributed ser- 
vices, as currently discussed within the ISAIA project 
(Interop erable Systems for Archival Information Access; 
Hanisch 2000[ ), Simbad is prepared to deliver resource 
profiles and to format the query outputs in a standard 
way, for instance XML (Ochsenbein et al. 2000). 

As larger and larger astronomical datasets are being 
produced, the CDS is studying the concepts of a new gen- 
eration database of several billion objects, instead of the 
current several million objects. We expect Simbad to re- 
main an essential tool for astronomical research in the 
years to come. 
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